Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 2186 |
| Missing cells | 8045 |
| Missing cells (%) | 19.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 523.8 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 6 |
ReporterISO has constant value "CHN" | Constant |
FlowDesc has constant value "Import" | Constant |
PartnerISO has a high cardinality: 222 distinct values | High cardinality |
Key has a high cardinality: 2186 distinct values | High cardinality |
Country Code has a high cardinality: 201 distinct values | High cardinality |
j has a high cardinality: 188 distinct values | High cardinality |
RefYear is highly overall correlated with Period and 1 other fields | High correlation |
Period is highly overall correlated with RefYear and 1 other fields | High correlation |
Cifvalue is highly overall correlated with PrimaryValue and 5 other fields | High correlation |
PrimaryValue is highly overall correlated with Cifvalue and 5 other fields | High correlation |
year is highly overall correlated with RefYear and 1 other fields | High correlation |
gdp is highly overall correlated with Cifvalue and 6 other fields | High correlation |
population is highly overall correlated with Cifvalue and 3 other fields | High correlation |
sum_pos_tweets is highly overall correlated with Cifvalue and 5 other fields | High correlation |
count_tweets is highly overall correlated with Cifvalue and 6 other fields | High correlation |
sum_likes is highly overall correlated with gdp and 3 other fields | High correlation |
sum_retweets is highly overall correlated with Cifvalue and 5 other fields | High correlation |
Country Code has 202 (9.2%) missing values | Missing |
year has 202 (9.2%) missing values | Missing |
gdp has 202 (9.2%) missing values | Missing |
population has 202 (9.2%) missing values | Missing |
j has 326 (14.9%) missing values | Missing |
dist has 356 (16.3%) missing values | Missing |
sum_pos_tweets has 1311 (60.0%) missing values | Missing |
count_tweets has 1311 (60.0%) missing values | Missing |
sum_political_tweets has 1311 (60.0%) missing values | Missing |
sum_likes has 1311 (60.0%) missing values | Missing |
sum_retweets has 1311 (60.0%) missing values | Missing |
PartnerISO is uniformly distributed | Uniform |
Key is uniformly distributed | Uniform |
Country Code is uniformly distributed | Uniform |
j is uniformly distributed | Uniform |
Key has unique values | Unique |
sum_pos_tweets has 141 (6.5%) zeros | Zeros |
sum_political_tweets has 800 (36.6%) zeros | Zeros |
sum_likes has 197 (9.0%) zeros | Zeros |
sum_retweets has 229 (10.5%) zeros | Zeros |
Reproduction
| Analysis started | 2023-04-11 14:47:47.672790 |
|---|---|
| Analysis finished | 2023-04-11 14:48:11.143928 |
| Duration | 23.47 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
RefYear
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2016.511 |
| Minimum | 2012 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 2012 |
|---|---|
| 5-th percentile | 2012 |
| Q1 | 2014 |
| median | 2017 |
| Q3 | 2019 |
| 95-th percentile | 2021 |
| Maximum | 2021 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8745101 |
|---|---|
| Coefficient of variation (CV) | 0.001425487 |
| Kurtosis | -1.2252865 |
| Mean | 2016.511 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | -0.0067243127 |
| Sum | 4408093 |
| Variance | 8.2628085 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2018 | 220 | |
| 2019 | 220 | |
| 2021 | 220 | |
| 2013 | 219 | |
| 2016 | 219 | |
| 2017 | 219 | |
| 2020 | 219 | |
| 2012 | 218 | |
| 2014 | 217 | |
| 2015 | 215 |
| Value | Count | Frequency (%) |
| 2012 | 218 | |
| 2013 | 219 | |
| 2014 | 217 | |
| 2015 | 215 | |
| 2016 | 219 | |
| 2017 | 219 | |
| 2018 | 220 | |
| 2019 | 220 | |
| 2020 | 219 | |
| 2021 | 220 |
| Value | Count | Frequency (%) |
| 2021 | 220 | |
| 2020 | 219 | |
| 2019 | 220 | |
| 2018 | 220 | |
| 2017 | 219 | |
| 2016 | 219 | |
| 2015 | 215 | |
| 2014 | 217 | |
| 2013 | 219 | |
| 2012 | 218 |
Period
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2016.511 |
| Minimum | 2012 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 2012 |
|---|---|
| 5-th percentile | 2012 |
| Q1 | 2014 |
| median | 2017 |
| Q3 | 2019 |
| 95-th percentile | 2021 |
| Maximum | 2021 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8745101 |
|---|---|
| Coefficient of variation (CV) | 0.001425487 |
| Kurtosis | -1.2252865 |
| Mean | 2016.511 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | -0.0067243127 |
| Sum | 4408093 |
| Variance | 8.2628085 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2018 | 220 | |
| 2019 | 220 | |
| 2021 | 220 | |
| 2013 | 219 | |
| 2016 | 219 | |
| 2017 | 219 | |
| 2020 | 219 | |
| 2012 | 218 | |
| 2014 | 217 | |
| 2015 | 215 |
| Value | Count | Frequency (%) |
| 2012 | 218 | |
| 2013 | 219 | |
| 2014 | 217 | |
| 2015 | 215 | |
| 2016 | 219 | |
| 2017 | 219 | |
| 2018 | 220 | |
| 2019 | 220 | |
| 2020 | 219 | |
| 2021 | 220 |
| Value | Count | Frequency (%) |
| 2021 | 220 | |
| 2020 | 219 | |
| 2019 | 220 | |
| 2018 | 220 | |
| 2017 | 219 | |
| 2016 | 219 | |
| 2015 | 215 | |
| 2014 | 217 | |
| 2013 | 219 | |
| 2012 | 218 |
ReporterISO
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 145.2 KiB |
| CHN |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6558 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CHN |
|---|---|
| 2nd row | CHN |
| 3rd row | CHN |
| 4th row | CHN |
| 5th row | CHN |
Common Values
| Value | Count | Frequency (%) |
| CHN | 2186 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| chn | 2186 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 2186 | |
| H | 2186 | |
| N | 2186 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6558 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2186 | |
| H | 2186 | |
| N | 2186 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6558 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 2186 | |
| H | 2186 | |
| N | 2186 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6558 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 2186 | |
| H | 2186 | |
| N | 2186 |
FlowDesc
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 151.6 KiB |
| Import |
|---|
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 13116 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Import |
|---|---|
| 2nd row | Import |
| 3rd row | Import |
| 4th row | Import |
| 5th row | Import |
Common Values
| Value | Count | Frequency (%) |
| Import | 2186 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| import | 2186 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 2186 | |
| m | 2186 | |
| p | 2186 | |
| o | 2186 | |
| r | 2186 | |
| t | 2186 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10930 | |
| Uppercase Letter | 2186 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 2186 | |
| p | 2186 | |
| o | 2186 | |
| r | 2186 | |
| t | 2186 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 2186 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13116 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 2186 | |
| m | 2186 | |
| p | 2186 | |
| o | 2186 | |
| r | 2186 | |
| t | 2186 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13116 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 2186 | |
| m | 2186 | |
| p | 2186 | |
| o | 2186 | |
| r | 2186 | |
| t | 2186 |
PartnerISO
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 222 |
|---|---|
| Distinct (%) | 10.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 145.4 KiB |
| W00 | 10 |
|---|---|
| CUW | 10 |
| BES | 10 |
| NCL | 10 |
| VUT | 10 |
| Other values (217) |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 3.0182983 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6598 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | W00 |
|---|---|
| 2nd row | AFG |
| 3rd row | ALB |
| 4th row | DZA |
| 5th row | AND |
Common Values
| Value | Count | Frequency (%) |
| W00 | 10 | 0.5% |
| CUW | 10 | 0.5% |
| BES | 10 | 0.5% |
| NCL | 10 | 0.5% |
| VUT | 10 | 0.5% |
| NZL | 10 | 0.5% |
| NIC | 10 | 0.5% |
| NER | 10 | 0.5% |
| NGA | 10 | 0.5% |
| 19,00Â F | 10 | 0.5% |
| Other values (212) | 2086 |
Length
| Value | Count | Frequency (%) |
| w00 | 10 | 0.5% |
| bdi | 10 | 0.5% |
| dom | 10 | 0.5% |
| brn | 10 | 0.5% |
| alb | 10 | 0.5% |
| dza | 10 | 0.5% |
| and | 10 | 0.5% |
| ago | 10 | 0.5% |
| atg | 10 | 0.5% |
| aze | 10 | 0.5% |
| Other values (213) | 2096 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 488 | 7.4% |
| R | 480 | 7.3% |
| N | 426 | 6.5% |
| M | 418 | 6.3% |
| S | 395 | 6.0% |
| B | 359 | 5.4% |
| L | 348 | 5.3% |
| T | 320 | 4.8% |
| G | 320 | 4.8% |
| C | 289 | 4.4% |
| Other values (25) | 2755 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6422 | |
| Decimal Number | 136 | 2.1% |
| Space Separator | 20 | 0.3% |
| Other Punctuation | 10 | 0.2% |
| Connector Punctuation | 10 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 488 | 7.6% |
| R | 480 | 7.5% |
| N | 426 | 6.6% |
| M | 418 | 6.5% |
| S | 395 | 6.2% |
| B | 359 | 5.6% |
| L | 348 | 5.4% |
| T | 320 | 5.0% |
| G | 320 | 5.0% |
| C | 289 | 4.5% |
| Other values (16) | 2579 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 48 | |
| 0 | 40 | |
| 1 | 29 | |
| 7 | 10 | 7.4% |
| 5 | 9 | 6.6% |
Space Separator
| Value | Count | Frequency (%) |
| Â | 10 | |
| 10 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6422 | |
| Common | 176 | 2.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 488 | 7.6% |
| R | 480 | 7.5% |
| N | 426 | 6.6% |
| M | 418 | 6.5% |
| S | 395 | 6.2% |
| B | 359 | 5.6% |
| L | 348 | 5.4% |
| T | 320 | 5.0% |
| G | 320 | 5.0% |
| C | 289 | 4.5% |
| Other values (16) | 2579 |
Common
| Value | Count | Frequency (%) |
| 9 | 48 | |
| 0 | 40 | |
| 1 | 29 | |
| Â | 10 | 5.7% |
| , | 10 | 5.7% |
| 7 | 10 | 5.7% |
| _ | 10 | 5.7% |
| 10 | 5.7% | |
| 5 | 9 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6588 | |
| None | 10 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 488 | 7.4% |
| R | 480 | 7.3% |
| N | 426 | 6.5% |
| M | 418 | 6.3% |
| S | 395 | 6.0% |
| B | 359 | 5.4% |
| L | 348 | 5.3% |
| T | 320 | 4.9% |
| G | 320 | 4.9% |
| C | 289 | 4.4% |
| Other values (24) | 2745 |
None
| Value | Count | Frequency (%) |
| Â | 10 |
Cifvalue
Real number (ℝ)
| Distinct | 2184 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8120334 × 1010 |
| Minimum | 9 |
|---|---|
| Maximum | 2.6843627 × 1012 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 8953.5 |
| Q1 | 18730681 |
| median | 2.8651095 × 108 |
| Q3 | 3.8242945 × 109 |
| 95-th percentile | 5.302828 × 1010 |
| Maximum | 2.6843627 × 1012 |
| Range | 2.6843627 × 1012 |
| Interquartile range (IQR) | 3.8055638 × 109 |
Descriptive statistics
| Standard deviation | 1.3730537 × 1011 |
|---|---|
| Coefficient of variation (CV) | 7.5774194 |
| Kurtosis | 216.21251 |
| Mean | 1.8120334 × 1010 |
| Median Absolute Deviation (MAD) | 2.8648118 × 108 |
| Skewness | 14.312647 |
| Sum | 3.9611051 × 1013 |
| Variance | 1.8852766 × 1022 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2753 | 2 | 0.1% |
| 71 | 2 | 0.1% |
| 1.818199228 × 1012 | 1 | < 0.1% |
| 3421521148 | 1 | < 0.1% |
| 21774777 | 1 | < 0.1% |
| 33160256 | 1 | < 0.1% |
| 40063 | 1 | < 0.1% |
| 2172086000 | 1 | < 0.1% |
| 81763041 | 1 | < 0.1% |
| 2825845569 | 1 | < 0.1% |
| Other values (2174) | 2174 |
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 13 | 1 | |
| 31 | 1 | |
| 33 | 1 | |
| 34 | 1 | |
| 39 | 1 | |
| 45 | 1 | |
| 71 | 2 | |
| 72 | 1 | |
| 76 | 1 |
| Value | Count | Frequency (%) |
| 2.684362679 × 1012 | 1 | |
| 2.133605397 × 1012 | 1 | |
| 2.079285499 × 1012 | 1 | |
| 2.069567865 × 1012 | 1 | |
| 1.959234625 × 1012 | 1 | |
| 1.949992315 × 1012 | 1 | |
| 1.843792939 × 1012 | 1 | |
| 1.818199228 × 1012 | 1 | |
| 1.679564325 × 1012 | 1 | |
| 1.587920688 × 1012 | 1 |
PrimaryValue
Real number (ℝ)
| Distinct | 2184 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8120334 × 1010 |
| Minimum | 9 |
|---|---|
| Maximum | 2.6843627 × 1012 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 8953.5 |
| Q1 | 18730681 |
| median | 2.8651095 × 108 |
| Q3 | 3.8242945 × 109 |
| 95-th percentile | 5.302828 × 1010 |
| Maximum | 2.6843627 × 1012 |
| Range | 2.6843627 × 1012 |
| Interquartile range (IQR) | 3.8055638 × 109 |
Descriptive statistics
| Standard deviation | 1.3730537 × 1011 |
|---|---|
| Coefficient of variation (CV) | 7.5774194 |
| Kurtosis | 216.21251 |
| Mean | 1.8120334 × 1010 |
| Median Absolute Deviation (MAD) | 2.8648118 × 108 |
| Skewness | 14.312647 |
| Sum | 3.9611051 × 1013 |
| Variance | 1.8852766 × 1022 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2753 | 2 | 0.1% |
| 71 | 2 | 0.1% |
| 1.818199228 × 1012 | 1 | < 0.1% |
| 3421521148 | 1 | < 0.1% |
| 21774777 | 1 | < 0.1% |
| 33160256 | 1 | < 0.1% |
| 40063 | 1 | < 0.1% |
| 2172086000 | 1 | < 0.1% |
| 81763041 | 1 | < 0.1% |
| 2825845569 | 1 | < 0.1% |
| Other values (2174) | 2174 |
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 13 | 1 | |
| 31 | 1 | |
| 33 | 1 | |
| 34 | 1 | |
| 39 | 1 | |
| 45 | 1 | |
| 71 | 2 | |
| 72 | 1 | |
| 76 | 1 |
| Value | Count | Frequency (%) |
| 2.684362679 × 1012 | 1 | |
| 2.133605397 × 1012 | 1 | |
| 2.079285499 × 1012 | 1 | |
| 2.069567865 × 1012 | 1 | |
| 1.959234625 × 1012 | 1 | |
| 1.949992315 × 1012 | 1 | |
| 1.843792939 × 1012 | 1 | |
| 1.818199228 × 1012 | 1 | |
| 1.679564325 × 1012 | 1 | |
| 1.587920688 × 1012 | 1 |
Key
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 2186 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.1 KiB |
| W00_2012 | 1 |
|---|---|
| PNG_2018 | 1 |
| NOR_2018 | 1 |
| FSM_2018 | 1 |
| MHL_2018 | 1 |
| Other values (2181) |
Length
| Max length | 12 |
|---|---|
| Median length | 8 |
| Mean length | 8.0182983 |
| Min length | 8 |
Characters and Unicode
| Total characters | 17528 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2186 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | W00_2012 |
|---|---|
| 2nd row | AFG_2012 |
| 3rd row | ALB_2012 |
| 4th row | DZA_2012 |
| 5th row | AND_2012 |
Common Values
| Value | Count | Frequency (%) |
| W00_2012 | 1 | < 0.1% |
| PNG_2018 | 1 | < 0.1% |
| NOR_2018 | 1 | < 0.1% |
| FSM_2018 | 1 | < 0.1% |
| MHL_2018 | 1 | < 0.1% |
| PLW_2018 | 1 | < 0.1% |
| PAK_2018 | 1 | < 0.1% |
| PAN_2018 | 1 | < 0.1% |
| PRY_2018 | 1 | < 0.1% |
| TGO_2018 | 1 | < 0.1% |
| Other values (2176) | 2176 |
Length
| Value | Count | Frequency (%) |
| 19,00 | 10 | 0.5% |
| x | 10 | 0.5% |
| w00_2012 | 1 | < 0.1% |
| bih_2012 | 1 | < 0.1% |
| bel_2012 | 1 | < 0.1% |
| cpv_2012 | 1 | < 0.1% |
| brb_2012 | 1 | < 0.1% |
| arm_2012 | 1 | < 0.1% |
| bgd_2012 | 1 | < 0.1% |
| bhr_2012 | 1 | < 0.1% |
| Other values (2178) | 2178 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2843 | |
| 0 | 2445 | |
| _ | 2196 | |
| 1 | 1996 | 11.4% |
| A | 488 | 2.8% |
| R | 480 | 2.7% |
| N | 426 | 2.4% |
| M | 418 | 2.4% |
| S | 395 | 2.3% |
| B | 359 | 2.0% |
| Other values (30) | 5482 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8880 | |
| Uppercase Letter | 6422 | |
| Connector Punctuation | 2196 | 12.5% |
| Space Separator | 20 | 0.1% |
| Other Punctuation | 10 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 488 | 7.6% |
| R | 480 | 7.5% |
| N | 426 | 6.6% |
| M | 418 | 6.5% |
| S | 395 | 6.2% |
| B | 359 | 5.6% |
| L | 348 | 5.4% |
| G | 320 | 5.0% |
| T | 320 | 5.0% |
| C | 289 | 4.5% |
| Other values (16) | 2579 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2843 | |
| 0 | 2445 | |
| 1 | 1996 | |
| 9 | 268 | 3.0% |
| 7 | 229 | 2.6% |
| 5 | 224 | 2.5% |
| 8 | 220 | 2.5% |
| 3 | 219 | 2.5% |
| 6 | 219 | 2.5% |
| 4 | 217 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| Â | 10 | |
| 10 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2196 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11106 | |
| Latin | 6422 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 488 | 7.6% |
| R | 480 | 7.5% |
| N | 426 | 6.6% |
| M | 418 | 6.5% |
| S | 395 | 6.2% |
| B | 359 | 5.6% |
| L | 348 | 5.4% |
| G | 320 | 5.0% |
| T | 320 | 5.0% |
| C | 289 | 4.5% |
| Other values (16) | 2579 |
Common
| Value | Count | Frequency (%) |
| 2 | 2843 | |
| 0 | 2445 | |
| _ | 2196 | |
| 1 | 1996 | |
| 9 | 268 | 2.4% |
| 7 | 229 | 2.1% |
| 5 | 224 | 2.0% |
| 8 | 220 | 2.0% |
| 3 | 219 | 2.0% |
| 6 | 219 | 2.0% |
| Other values (4) | 247 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17518 | |
| None | 10 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2843 | |
| 0 | 2445 | |
| _ | 2196 | |
| 1 | 1996 | 11.4% |
| A | 488 | 2.8% |
| R | 480 | 2.7% |
| N | 426 | 2.4% |
| M | 418 | 2.4% |
| S | 395 | 2.3% |
| B | 359 | 2.0% |
| Other values (29) | 5472 |
None
| Value | Count | Frequency (%) |
| Â | 10 |
Country Code
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 201 |
|---|---|
| Distinct (%) | 10.1% |
| Missing | 202 |
| Missing (%) | 9.2% |
| Memory size | 139.6 KiB |
| TLS | 10 |
|---|---|
| NLD | 10 |
| CUW | 10 |
| ABW | 10 |
| NCL | 10 |
| Other values (196) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 5952 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AFG |
|---|---|
| 2nd row | ALB |
| 3rd row | DZA |
| 4th row | AND |
| 5th row | AGO |
Common Values
| Value | Count | Frequency (%) |
| TLS | 10 | 0.5% |
| NLD | 10 | 0.5% |
| CUW | 10 | 0.5% |
| ABW | 10 | 0.5% |
| NCL | 10 | 0.5% |
| VUT | 10 | 0.5% |
| NZL | 10 | 0.5% |
| NIC | 10 | 0.5% |
| NER | 10 | 0.5% |
| NGA | 10 | 0.5% |
| Other values (191) | 1884 | |
| (Missing) | 202 | 9.2% |
Length
| Value | Count | Frequency (%) |
| tls | 10 | 0.5% |
| aus | 10 | 0.5% |
| ben | 10 | 0.5% |
| bhs | 10 | 0.5% |
| alb | 10 | 0.5% |
| dza | 10 | 0.5% |
| and | 10 | 0.5% |
| ago | 10 | 0.5% |
| atg | 10 | 0.5% |
| aze | 10 | 0.5% |
| Other values (191) | 1884 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 460 | 7.7% |
| R | 447 | 7.5% |
| N | 413 | 6.9% |
| M | 392 | 6.6% |
| L | 339 | 5.7% |
| S | 335 | 5.6% |
| B | 328 | 5.5% |
| T | 306 | 5.1% |
| G | 299 | 5.0% |
| C | 278 | 4.7% |
| Other values (16) | 2355 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5952 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 460 | 7.7% |
| R | 447 | 7.5% |
| N | 413 | 6.9% |
| M | 392 | 6.6% |
| L | 339 | 5.7% |
| S | 335 | 5.6% |
| B | 328 | 5.5% |
| T | 306 | 5.1% |
| G | 299 | 5.0% |
| C | 278 | 4.7% |
| Other values (16) | 2355 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5952 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 460 | 7.7% |
| R | 447 | 7.5% |
| N | 413 | 6.9% |
| M | 392 | 6.6% |
| L | 339 | 5.7% |
| S | 335 | 5.6% |
| B | 328 | 5.5% |
| T | 306 | 5.1% |
| G | 299 | 5.0% |
| C | 278 | 4.7% |
| Other values (16) | 2355 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5952 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 460 | 7.7% |
| R | 447 | 7.5% |
| N | 413 | 6.9% |
| M | 392 | 6.6% |
| L | 339 | 5.7% |
| S | 335 | 5.6% |
| B | 328 | 5.5% |
| T | 306 | 5.1% |
| G | 299 | 5.0% |
| C | 278 | 4.7% |
| Other values (16) | 2355 |
year
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 202 |
| Missing (%) | 9.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2016.4743 |
| Minimum | 2012 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 2012 |
|---|---|
| 5-th percentile | 2012 |
| Q1 | 2014 |
| median | 2016 |
| Q3 | 2019 |
| 95-th percentile | 2021 |
| Maximum | 2021 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.8644524 |
|---|---|
| Coefficient of variation (CV) | 0.0014205251 |
| Kurtosis | -1.220005 |
| Mean | 2016.4743 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.0094079217 |
| Sum | 4000685 |
| Variance | 8.2050878 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2014 | 201 | |
| 2013 | 200 | |
| 2015 | 200 | |
| 2012 | 199 | |
| 2016 | 199 | |
| 2017 | 199 | |
| 2018 | 199 | |
| 2019 | 198 | |
| 2020 | 197 | |
| 2021 | 192 | |
| (Missing) | 202 |
| Value | Count | Frequency (%) |
| 2012 | 199 | |
| 2013 | 200 | |
| 2014 | 201 | |
| 2015 | 200 | |
| 2016 | 199 | |
| 2017 | 199 | |
| 2018 | 199 | |
| 2019 | 198 | |
| 2020 | 197 | |
| 2021 | 192 |
| Value | Count | Frequency (%) |
| 2021 | 192 | |
| 2020 | 197 | |
| 2019 | 198 | |
| 2018 | 199 | |
| 2017 | 199 | |
| 2016 | 199 | |
| 2015 | 200 | |
| 2014 | 201 | |
| 2013 | 200 | |
| 2012 | 199 |
gdp
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 1984 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 202 |
| Missing (%) | 9.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.0900515 × 1011 |
| Minimum | 36811660 |
|---|---|
| Maximum | 2.3315081 × 1013 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 36811660 |
|---|---|
| 5-th percentile | 7.8173935 × 108 |
| Q1 | 6.8861102 × 109 |
| median | 3.013949 × 1010 |
| Q3 | 2.0627499 × 1011 |
| 95-th percentile | 1.7200859 × 1012 |
| Maximum | 2.3315081 × 1013 |
| Range | 2.3315044 × 1013 |
| Interquartile range (IQR) | 1.9938888 × 1011 |
Descriptive statistics
| Standard deviation | 1.7353012 × 1012 |
|---|---|
| Coefficient of variation (CV) | 4.2427367 |
| Kurtosis | 89.533944 |
| Mean | 4.0900515 × 1011 |
| Median Absolute Deviation (MAD) | 2.8467168 × 1010 |
| Skewness | 8.8836065 |
| Sum | 8.1146622 × 1014 |
| Variance | 3.0112701 × 1024 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.368206225 × 1010 | 1 | < 0.1% |
| 401932300 | 1 | < 0.1% |
| 4.369996926 × 1011 | 1 | < 0.1% |
| 4.217392102 × 1011 | 1 | < 0.1% |
| 1.280866053 × 1010 | 1 | < 0.1% |
| 1.302523991 × 1010 | 1 | < 0.1% |
| 2.11953111 × 1011 | 1 | < 0.1% |
| 914736985.4 | 1 | < 0.1% |
| 9846922416 | 1 | < 0.1% |
| 3202234637 | 1 | < 0.1% |
| Other values (1974) | 1974 | |
| (Missing) | 202 | 9.2% |
| Value | Count | Frequency (%) |
| 36811659.53 | 1 | |
| 38617493.72 | 1 | |
| 38759689.92 | 1 | |
| 39345620.21 | 1 | |
| 41629497.47 | 1 | |
| 45217657.88 | 1 | |
| 47818290.5 | 1 | |
| 54223149.11 | 1 | |
| 55054710.62 | 1 | |
| 63100961.54 | 1 |
| Value | Count | Frequency (%) |
| 2.331508056 × 1013 | 1 | |
| 2.138097612 × 1013 | 1 | |
| 2.106047361 × 1013 | 1 | |
| 2.053305731 × 1013 | 1 | |
| 1.947733655 × 1013 | 1 | |
| 1.869511084 × 1013 | 1 | |
| 1.820602074 × 1013 | 1 | |
| 1.773406265 × 1013 | 1 | |
| 1.755068017 × 1013 | 1 | |
| 1.684319099 × 1013 | 1 |
population
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 1984 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 202 |
| Missing (%) | 9.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37430837 |
| Minimum | 10444 |
|---|---|
| Maximum | 1.41236 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 10444 |
|---|---|
| 5-th percentile | 64603.5 |
| Q1 | 1265980 |
| median | 7178445.5 |
| Q3 | 26065778 |
| 95-th percentile | 1.262216 × 108 |
| Maximum | 1.41236 × 109 |
| Range | 1.4123496 × 109 |
| Interquartile range (IQR) | 24799798 |
Descriptive statistics
| Standard deviation | 1.4165282 × 108 |
|---|---|
| Coefficient of variation (CV) | 3.7843882 |
| Kurtosis | 75.993728 |
| Mean | 37430837 |
| Median Absolute Deviation (MAD) | 6822149.5 |
| Skewness | 8.4523757 |
| Sum | 7.426278 × 1010 |
| Variance | 2.006552 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2405680 | 1 | < 0.1% |
| 110929 | 1 | < 0.1% |
| 5311916 | 1 | < 0.1% |
| 198387623 | 1 | < 0.1% |
| 22577058 | 1 | < 0.1% |
| 6572233 | 1 | < 0.1% |
| 4900600 | 1 | < 0.1% |
| 297298 | 1 | < 0.1% |
| 271170 | 1 | < 0.1% |
| 105962 | 1 | < 0.1% |
| Other values (1974) | 1974 | |
| (Missing) | 202 | 9.2% |
| Value | Count | Frequency (%) |
| 10444 | 1 | |
| 10694 | 1 | |
| 10828 | 1 | |
| 10852 | 1 | |
| 10854 | 1 | |
| 10865 | 1 | |
| 10877 | 1 | |
| 10899 | 1 | |
| 10918 | 1 | |
| 10940 | 1 |
| Value | Count | Frequency (%) |
| 1412360000 | 1 | |
| 1411100000 | 1 | |
| 1407745000 | 1 | |
| 1407563842 | 1 | |
| 1402760000 | 1 | |
| 1396387127 | 1 | |
| 1396215000 | 1 | |
| 1387790000 | 1 | |
| 1383112050 | 1 | |
| 1379860000 | 1 |
j
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 188 |
|---|---|
| Distinct (%) | 10.1% |
| Missing | 326 |
| Missing (%) | 14.9% |
| Memory size | 136.2 KiB |
| NLD | 10 |
|---|---|
| NCL | 10 |
| VUT | 10 |
| NZL | 10 |
| NIC | 10 |
| Other values (183) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 5580 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AFG |
|---|---|
| 2nd row | ALB |
| 3rd row | DZA |
| 4th row | AND |
| 5th row | AGO |
Common Values
| Value | Count | Frequency (%) |
| NLD | 10 | 0.5% |
| NCL | 10 | 0.5% |
| VUT | 10 | 0.5% |
| NZL | 10 | 0.5% |
| NIC | 10 | 0.5% |
| NER | 10 | 0.5% |
| NGA | 10 | 0.5% |
| NOR | 10 | 0.5% |
| FSM | 10 | 0.5% |
| MHL | 10 | 0.5% |
| Other values (178) | 1760 | |
| (Missing) | 326 | 14.9% |
Length
| Value | Count | Frequency (%) |
| nld | 10 | 0.5% |
| bra | 10 | 0.5% |
| brn | 10 | 0.5% |
| slb | 10 | 0.5% |
| aut | 10 | 0.5% |
| alb | 10 | 0.5% |
| dza | 10 | 0.5% |
| and | 10 | 0.5% |
| ago | 10 | 0.5% |
| atg | 10 | 0.5% |
| Other values (178) | 1760 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 440 | 7.9% |
| R | 437 | 7.8% |
| N | 393 | 7.0% |
| M | 382 | 6.8% |
| B | 308 | 5.5% |
| G | 299 | 5.4% |
| L | 299 | 5.4% |
| T | 296 | 5.3% |
| S | 287 | 5.1% |
| C | 248 | 4.4% |
| Other values (16) | 2191 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5580 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 440 | 7.9% |
| R | 437 | 7.8% |
| N | 393 | 7.0% |
| M | 382 | 6.8% |
| B | 308 | 5.5% |
| G | 299 | 5.4% |
| L | 299 | 5.4% |
| T | 296 | 5.3% |
| S | 287 | 5.1% |
| C | 248 | 4.4% |
| Other values (16) | 2191 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5580 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 440 | 7.9% |
| R | 437 | 7.8% |
| N | 393 | 7.0% |
| M | 382 | 6.8% |
| B | 308 | 5.5% |
| G | 299 | 5.4% |
| L | 299 | 5.4% |
| T | 296 | 5.3% |
| S | 287 | 5.1% |
| C | 248 | 4.4% |
| Other values (16) | 2191 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5580 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 440 | 7.9% |
| R | 437 | 7.8% |
| N | 393 | 7.0% |
| M | 382 | 6.8% |
| B | 308 | 5.5% |
| G | 299 | 5.4% |
| L | 299 | 5.4% |
| T | 296 | 5.3% |
| S | 287 | 5.1% |
| C | 248 | 4.4% |
| Other values (16) | 2191 |
dist
Real number (ℝ)
| Distinct | 185 |
|---|---|
| Distinct (%) | 10.1% |
| Missing | 356 |
| Missing (%) | 16.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9059.3686 |
| Minimum | 955.6511 |
|---|---|
| Maximum | 19297.47 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 955.6511 |
|---|---|
| 5-th percentile | 3036.238 |
| Q1 | 6523.571 |
| median | 8390.566 |
| Q3 | 11903.59 |
| 95-th percentile | 14866.92 |
| Maximum | 19297.47 |
| Range | 18341.819 |
| Interquartile range (IQR) | 5380.019 |
Descriptive statistics
| Standard deviation | 3856.9611 |
|---|---|
| Coefficient of variation (CV) | 0.42574282 |
| Kurtosis | -0.29654095 |
| Mean | 9059.3686 |
| Median Absolute Deviation (MAD) | 2650.464 |
| Skewness | 0.27927191 |
| Sum | 16578645 |
| Variance | 14876149 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6164.891 | 10 | 0.5% |
| 14155.35 | 10 | 0.5% |
| 8648.526 | 10 | 0.5% |
| 8369.766 | 10 | 0.5% |
| 11041.03 | 10 | 0.5% |
| 13785.57 | 10 | 0.5% |
| 11024.17 | 10 | 0.5% |
| 11466.06 | 10 | 0.5% |
| 7031.006 | 10 | 0.5% |
| 5548.78 | 10 | 0.5% |
| Other values (175) | 1730 | |
| (Missing) | 356 | 16.3% |
| Value | Count | Frequency (%) |
| 955.6511 | 10 | |
| 1172.047 | 10 | |
| 1976.249 | 10 | |
| 1982.745 | 10 | |
| 2098.111 | 10 | |
| 2330.799 | 10 | |
| 2778.652 | 10 | |
| 2812.561 | 10 | |
| 2850.319 | 10 | |
| 3036.238 | 10 |
| Value | Count | Frequency (%) |
| 19297.47 | 10 | |
| 19175.59 | 10 | |
| 19079.88 | 10 | |
| 18311.35 | 10 | |
| 17614.3 | 10 | |
| 17389.85 | 10 | |
| 16666.29 | 10 | |
| 15364.41 | 10 | |
| 14937.48 | 10 | |
| 14866.92 | 10 |
sum_pos_tweets
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 200 |
|---|---|
| Distinct (%) | 22.9% |
| Missing | 1311 |
| Missing (%) | 60.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 110.75429 |
| Minimum | 0 |
|---|---|
| Maximum | 4639 |
| Zeros | 141 |
| Zeros (%) | 6.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 5 |
| Q3 | 34 |
| 95-th percentile | 595.9 |
| Maximum | 4639 |
| Range | 4639 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 386.02809 |
|---|---|
| Coefficient of variation (CV) | 3.485446 |
| Kurtosis | 53.807417 |
| Mean | 110.75429 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 6.5799895 |
| Sum | 96910 |
| Variance | 149017.68 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 141 | 6.5% |
| 1 | 117 | 5.4% |
| 2 | 61 | 2.8% |
| 3 | 56 | 2.6% |
| 4 | 40 | 1.8% |
| 5 | 27 | 1.2% |
| 6 | 22 | 1.0% |
| 7 | 15 | 0.7% |
| 14 | 14 | 0.6% |
| 8 | 14 | 0.6% |
| Other values (190) | 368 | 16.8% |
| (Missing) | 1311 |
| Value | Count | Frequency (%) |
| 0 | 141 | |
| 1 | 117 | |
| 2 | 61 | |
| 3 | 56 | 2.6% |
| 4 | 40 | 1.8% |
| 5 | 27 | 1.2% |
| 6 | 22 | 1.0% |
| 7 | 15 | 0.7% |
| 8 | 14 | 0.6% |
| 9 | 10 | 0.5% |
| Value | Count | Frequency (%) |
| 4639 | 1 | |
| 3953 | 1 | |
| 3729 | 1 | |
| 3216 | 1 | |
| 2522 | 1 | |
| 2330 | 1 | |
| 2316 | 1 | |
| 2141 | 1 | |
| 2063 | 1 | |
| 2041 | 1 |
count_tweets
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 280 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 1311 |
| Missing (%) | 60.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 265.83657 |
| Minimum | 1 |
|---|---|
| Maximum | 9943 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 16 |
| Q3 | 97.5 |
| 95-th percentile | 1281 |
| Maximum | 9943 |
| Range | 9942 |
| Interquartile range (IQR) | 94.5 |
Descriptive statistics
| Standard deviation | 891.00303 |
|---|---|
| Coefficient of variation (CV) | 3.3516947 |
| Kurtosis | 45.444824 |
| Mean | 265.83657 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 6.1454845 |
| Sum | 232607 |
| Variance | 793886.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 98 | 4.5% |
| 2 | 72 | 3.3% |
| 3 | 64 | 2.9% |
| 5 | 35 | 1.6% |
| 4 | 29 | 1.3% |
| 12 | 22 | 1.0% |
| 6 | 22 | 1.0% |
| 10 | 18 | 0.8% |
| 8 | 17 | 0.8% |
| 11 | 15 | 0.7% |
| Other values (270) | 483 | 22.1% |
| (Missing) | 1311 |
| Value | Count | Frequency (%) |
| 1 | 98 | |
| 2 | 72 | |
| 3 | 64 | |
| 4 | 29 | 1.3% |
| 5 | 35 | 1.6% |
| 6 | 22 | 1.0% |
| 7 | 11 | 0.5% |
| 8 | 17 | 0.8% |
| 9 | 10 | 0.5% |
| 10 | 18 | 0.8% |
| Value | Count | Frequency (%) |
| 9943 | 1 | |
| 8541 | 1 | |
| 7967 | 1 | |
| 7440 | 1 | |
| 6004 | 1 | |
| 5668 | 1 | |
| 5609 | 1 | |
| 5570 | 1 | |
| 5373 | 1 | |
| 4949 | 1 |
sum_political_tweets
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 28 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 1311 |
| Missing (%) | 60.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.804571 |
| Minimum | 0 |
|---|---|
| Maximum | 2836 |
| Zeros | 800 |
| Zeros (%) | 36.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3 |
| Maximum | 2836 |
| Range | 2836 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 150.04681 |
|---|---|
| Coefficient of variation (CV) | 12.710907 |
| Kurtosis | 266.7716 |
| Mean | 11.804571 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.91247 |
| Sum | 10329 |
| Variance | 22514.045 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 800 | |
| 1 | 20 | 0.9% |
| 2 | 10 | 0.5% |
| 3 | 7 | 0.3% |
| 5 | 5 | 0.2% |
| 6 | 4 | 0.2% |
| 12 | 3 | 0.1% |
| 4 | 3 | 0.1% |
| 13 | 2 | 0.1% |
| 20 | 2 | 0.1% |
| Other values (18) | 19 | 0.9% |
| (Missing) | 1311 |
| Value | Count | Frequency (%) |
| 0 | 800 | |
| 1 | 20 | 0.9% |
| 2 | 10 | 0.5% |
| 3 | 7 | 0.3% |
| 4 | 3 | 0.1% |
| 5 | 5 | 0.2% |
| 6 | 4 | 0.2% |
| 7 | 2 | 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2836 | 1 | |
| 2480 | 1 | |
| 2058 | 1 | |
| 653 | 1 | |
| 644 | 1 | |
| 515 | 1 | |
| 475 | 1 | |
| 153 | 1 | |
| 67 | 1 | |
| 65 | 1 |
sum_likes
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 319 |
|---|---|
| Distinct (%) | 36.5% |
| Missing | 1311 |
| Missing (%) | 60.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1770.5646 |
| Minimum | 0 |
|---|---|
| Maximum | 290050 |
| Zeros | 197 |
| Zeros (%) | 9.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 11 |
| Q3 | 154.5 |
| 95-th percentile | 3998.3 |
| Maximum | 290050 |
| Range | 290050 |
| Interquartile range (IQR) | 153.5 |
Descriptive statistics
| Standard deviation | 13523.049 |
|---|---|
| Coefficient of variation (CV) | 7.6377044 |
| Kurtosis | 283.99065 |
| Mean | 1770.5646 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 15.502566 |
| Sum | 1549244 |
| Variance | 1.8287285 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 197 | 9.0% |
| 1 | 74 | 3.4% |
| 2 | 38 | 1.7% |
| 3 | 34 | 1.6% |
| 4 | 21 | 1.0% |
| 5 | 20 | 0.9% |
| 6 | 15 | 0.7% |
| 11 | 10 | 0.5% |
| 8 | 10 | 0.5% |
| 9 | 9 | 0.4% |
| Other values (309) | 447 | 20.4% |
| (Missing) | 1311 |
| Value | Count | Frequency (%) |
| 0 | 197 | |
| 1 | 74 | 3.4% |
| 2 | 38 | 1.7% |
| 3 | 34 | 1.6% |
| 4 | 21 | 1.0% |
| 5 | 20 | 0.9% |
| 6 | 15 | 0.7% |
| 7 | 8 | 0.4% |
| 8 | 10 | 0.5% |
| 9 | 9 | 0.4% |
| Value | Count | Frequency (%) |
| 290050 | 1 | |
| 173770 | 1 | |
| 150127 | 1 | |
| 78382 | 1 | |
| 56259 | 1 | |
| 56251 | 1 | |
| 50321 | 1 | |
| 47142 | 1 | |
| 31330 | 1 | |
| 27562 | 1 |
sum_retweets
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 280 |
|---|---|
| Distinct (%) | 32.0% |
| Missing | 1311 |
| Missing (%) | 60.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 595.57257 |
| Minimum | 0 |
|---|---|
| Maximum | 67353 |
| Zeros | 229 |
| Zeros (%) | 10.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 7 |
| Q3 | 105 |
| 95-th percentile | 1790.2 |
| Maximum | 67353 |
| Range | 67353 |
| Interquartile range (IQR) | 105 |
Descriptive statistics
| Standard deviation | 3378.1066 |
|---|---|
| Coefficient of variation (CV) | 5.6720318 |
| Kurtosis | 211.22583 |
| Mean | 595.57257 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 12.826374 |
| Sum | 521126 |
| Variance | 11411604 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 229 | 10.5% |
| 1 | 80 | 3.7% |
| 2 | 43 | 2.0% |
| 3 | 24 | 1.1% |
| 6 | 21 | 1.0% |
| 5 | 19 | 0.9% |
| 4 | 17 | 0.8% |
| 13 | 14 | 0.6% |
| 9 | 13 | 0.6% |
| 7 | 11 | 0.5% |
| Other values (270) | 404 | 18.5% |
| (Missing) | 1311 |
| Value | Count | Frequency (%) |
| 0 | 229 | |
| 1 | 80 | 3.7% |
| 2 | 43 | 2.0% |
| 3 | 24 | 1.1% |
| 4 | 17 | 0.8% |
| 5 | 19 | 0.9% |
| 6 | 21 | 1.0% |
| 7 | 11 | 0.5% |
| 8 | 10 | 0.5% |
| 9 | 13 | 0.6% |
| Value | Count | Frequency (%) |
| 67353 | 1 | |
| 41928 | 1 | |
| 32288 | 1 | |
| 18792 | 1 | |
| 17523 | 1 | |
| 14947 | 1 | |
| 13862 | 1 | |
| 13776 | 1 | |
| 13319 | 1 | |
| 13019 | 1 |
| RefYear | Period | Cifvalue | PrimaryValue | year | gdp | population | dist | sum_pos_tweets | count_tweets | sum_political_tweets | sum_likes | sum_retweets | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RefYear | 1.000 | 1.000 | 0.027 | 0.027 | 1.000 | 0.022 | 0.016 | 0.002 | 0.264 | 0.212 | 0.019 | 0.439 | 0.296 |
| Period | 1.000 | 1.000 | 0.027 | 0.027 | 1.000 | 0.022 | 0.016 | 0.002 | 0.264 | 0.212 | 0.019 | 0.439 | 0.296 |
| Cifvalue | 0.027 | 0.027 | 1.000 | 1.000 | 0.035 | 0.870 | 0.691 | -0.280 | 0.517 | 0.553 | 0.327 | 0.475 | 0.522 |
| PrimaryValue | 0.027 | 0.027 | 1.000 | 1.000 | 0.035 | 0.870 | 0.691 | -0.280 | 0.517 | 0.553 | 0.327 | 0.475 | 0.522 |
| year | 1.000 | 1.000 | 0.035 | 0.035 | 1.000 | 0.022 | 0.016 | 0.002 | 0.264 | 0.212 | 0.019 | 0.439 | 0.296 |
| gdp | 0.022 | 0.022 | 0.870 | 0.870 | 0.022 | 1.000 | 0.769 | -0.250 | 0.632 | 0.677 | 0.388 | 0.552 | 0.619 |
| population | 0.016 | 0.016 | 0.691 | 0.691 | 0.016 | 0.769 | 1.000 | -0.153 | 0.499 | 0.519 | 0.371 | 0.447 | 0.494 |
| dist | 0.002 | 0.002 | -0.280 | -0.280 | 0.002 | -0.250 | -0.153 | 1.000 | -0.293 | -0.302 | -0.089 | -0.286 | -0.301 |
| sum_pos_tweets | 0.264 | 0.264 | 0.517 | 0.517 | 0.264 | 0.632 | 0.499 | -0.293 | 1.000 | 0.962 | 0.383 | 0.883 | 0.897 |
| count_tweets | 0.212 | 0.212 | 0.553 | 0.553 | 0.212 | 0.677 | 0.519 | -0.302 | 0.962 | 1.000 | 0.388 | 0.878 | 0.902 |
| sum_political_tweets | 0.019 | 0.019 | 0.327 | 0.327 | 0.019 | 0.388 | 0.371 | -0.089 | 0.383 | 0.388 | 1.000 | 0.362 | 0.390 |
| sum_likes | 0.439 | 0.439 | 0.475 | 0.475 | 0.439 | 0.552 | 0.447 | -0.286 | 0.883 | 0.878 | 0.362 | 1.000 | 0.942 |
| sum_retweets | 0.296 | 0.296 | 0.522 | 0.522 | 0.296 | 0.619 | 0.494 | -0.301 | 0.897 | 0.902 | 0.390 | 0.942 | 1.000 |
| RefYear | Period | ReporterISO | FlowDesc | PartnerISO | Cifvalue | PrimaryValue | Key | Country Code | year | gdp | population | j | dist | sum_pos_tweets | count_tweets | sum_political_tweets | sum_likes | sum_retweets | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2012 | 2012 | CHN | Import | W00 | 1.818199e+12 | 1818199227571 | W00_2012 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 2012 | 2012 | CHN | Import | AFG | 5.186565e+06 | 5186565 | AFG_2012 | AFG | 2012 | 2.020357e+10 | 30466479.0 | AFG | 4180.438 | NaN | NaN | NaN | NaN | NaN |
| 2 | 2012 | 2012 | CHN | Import | ALB | 1.427209e+08 | 142720886 | ALB_2012 | ALB | 2012 | 1.231983e+10 | 2900401.0 | ALB | 7686.079 | NaN | NaN | NaN | NaN | NaN |
| 3 | 2012 | 2012 | CHN | Import | DZA | 2.311906e+09 | 2311905609 | DZA_2012 | DZA | 2012 | 2.090590e+11 | 37260563.0 | DZA | 9117.676 | NaN | NaN | NaN | NaN | NaN |
| 4 | 2012 | 2012 | CHN | Import | AND | 3.240020e+05 | 324002 | AND_2012 | AND | 2012 | 3.188809e+09 | 71013.0 | AND | 8764.593 | NaN | NaN | NaN | NaN | NaN |
| 5 | 2012 | 2012 | CHN | Import | AGO | 3.356190e+10 | 33561896917 | AGO_2012 | AGO | 2012 | 1.249982e+11 | 25188292.0 | AGO | 11769.510 | NaN | NaN | NaN | NaN | NaN |
| 6 | 2012 | 2012 | CHN | Import | ATG | 7.135100e+04 | 71351 | ATG_2012 | ATG | 2012 | 1.199948e+09 | 87674.0 | ATG | 13681.690 | NaN | NaN | NaN | NaN | NaN |
| 7 | 2012 | 2012 | CHN | Import | AZE | 2.141617e+08 | 214161731 | AZE_2012 | AZE | 2012 | 6.968394e+10 | 9295784.0 | AZE | 5520.214 | NaN | NaN | NaN | NaN | NaN |
| 8 | 2012 | 2012 | CHN | Import | ARG | 6.560806e+09 | 6560805532 | ARG_2012 | ARG | 2012 | 5.459824e+11 | 41733271.0 | ARG | 19297.470 | NaN | NaN | NaN | NaN | NaN |
| 9 | 2012 | 2012 | CHN | Import | AUS | 8.456821e+10 | 84568208584 | AUS_2012 | AUS | 2012 | 1.546892e+12 | 22733465.0 | AUS | 8956.436 | NaN | NaN | NaN | NaN | NaN |
| RefYear | Period | ReporterISO | FlowDesc | PartnerISO | Cifvalue | PrimaryValue | Key | Country Code | year | gdp | population | j | dist | sum_pos_tweets | count_tweets | sum_political_tweets | sum_likes | sum_retweets | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2176 | 2021 | 2021 | CHN | Import | USA | 1.809719e+11 | 180971932243 | USA_2021 | USA | 2021 | 2.331508e+13 | 331893745.0 | USA | 10993.680 | NaN | NaN | NaN | NaN | NaN |
| 2177 | 2021 | 2021 | CHN | Import | BFA | 1.884526e+08 | 188452598 | BFA_2021 | BFA | 2021 | 1.973762e+10 | 22100683.0 | BFA | 11404.370 | 12.0 | 22.0 | 0.0 | 21.0 | 5.0 |
| 2178 | 2021 | 2021 | CHN | Import | URY | 3.623471e+09 | 3623470751 | URY_2021 | URY | 2021 | 5.931948e+10 | 3426260.0 | URY | 19175.590 | 1.0 | 6.0 | 0.0 | 2.0 | 0.0 |
| 2179 | 2021 | 2021 | CHN | Import | UZB | 1.540988e+09 | 1540987879 | UZB_2021 | UZB | 2021 | 6.923890e+10 | 34915100.0 | UZB | 3943.621 | NaN | NaN | NaN | NaN | NaN |
| 2180 | 2021 | 2021 | CHN | Import | VEN | 9.977931e+08 | 997793138 | VEN_2021 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2181 | 2021 | 2021 | CHN | Import | WLF | 1.475400e+04 | 14754 | WLF_2021 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2182 | 2021 | 2021 | CHN | Import | WSM | 6.432650e+05 | 643265 | WSM_2021 | WSM | 2021 | 8.438424e+08 | 218764.0 | WSM | 8268.319 | NaN | NaN | NaN | NaN | NaN |
| 2183 | 2021 | 2021 | CHN | Import | YEM | 4.708126e+08 | 470812557 | YEM_2021 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2184 | 2021 | 2021 | CHN | Import | ZMB | 4.385251e+09 | 4385251435 | ZMB_2021 | ZMB | 2021 | 2.214763e+10 | 19473125.0 | ZMB | 10960.790 | 1.0 | 3.0 | 0.0 | 12.0 | 5.0 |
| 2185 | 2021 | 2021 | CHN | Import | _X | 2.060227e+09 | 2060227074 | _X _2021 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |